Astronomy & Astrophysics manuscript no. 
(will be inserted by hand later) 



Smooth maps from clumpy data 



Marco Lombardi and Peter Schneider 



o 
o 

(N 

Ph. 
< 



> 

(N 
m 

o 
o 

:^ 

Oh 
O 



X 



Institiit fiir Astrophysik und Extraterrestrische Forschung, Universitat Bonn, Auf dem Hiigel 71, D-53121 Bonn, Germany 
Received ***(jjjg***. accepted ***(jate*** 

Abstract. We study an estimator for smoothing irregularly sampled data into a smooth map. The estimator has been widely 
used in astronomy, owing to its low level of noise; it involves a weight function - or smoothing kernel - w{9). We show that this 
estimator is not unbiased, in the sense that the expectation value of the smoothed map is not the underlying process convolved 
with w, but a convolution with a modified kernel Wcff (0). We show how to calculate Wea for a given kernel w and investigate its 
properties. In particular, it is found that (1) is normalized, (2) has a shape 'similar' to the original kernel w, (3) converges 
to w in the limit of high number density of data points, and (4) reduces to a top-hat filter in the limit of very small number 
density of data points. Hence, although the estimator is biased, the bias is well understood analytically, and since w^b has all 
the desired properties of a smoothing kernel, the estimator is in fact very useful. We present explicit examples for several filter 
functions which are commonly used, and provide a series expression valid in the limit of large density of data points. 
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1. Introduction 

A common problem in astronomy is the smoothing of some ir- 
regularly sampled data into a continuous map. It is hard to list 
all possible cases where such a problem is encountered. Just to 
give a few examples, we mention the determination of the stel- 
lar distribution of our Galaxy, the mapping of column density 
of dark molecular clouds from the absorption of background 
stars, the determination of the distribution function in globu- 
lar clusters from radial and, recently, proper motions of stars, 
the determination of cosmological large-scale structures from 
redshift surveys, and the mass reconstruction in galaxy clus- 
ters from the observed distortion of background galaxies using 
weak lensing techniques. Similarly, one-dimensional data, such 
as time series of some events, often need to be smoothed in or- 
der to obtain a real function. 

The use of a smooth map is convenient for at least two rea- 
sons. First, a smooth map can be better analyzed than irregu- 
larly sampled data. Second, if the smoothing is done in a suf- 
ficiently coarse way, the smooth map is significantly less noisy 
than the individual data. The drawback related to this last point 
is a loss in resolution, but often this is a price that we have 
to pay in order to obtain results with a decent signal-to-noise 
ratio. 

In many cases the transformation of irregularly sampled 
data into a smooth map follows a standard approach. A posi- 
tive weight function, describing the relative weight of a datum 
at the position 9 + 4>on the point 6, is introduced. This function 
is generally of the form 'w{4>), i.e. is independent of absolute 



position 6 of the point, and actually often depends only on the 
separation \cf)\. The weight function w{cf)) is also chosen so that 
its value is large when the datum is close to the point, i.e. when 
\4>\ is small, and vanishes when \cf)\ is large. Then, the data are 
averaged using a weighted mean with the weights given by the 
function w. More precisely, calling /„ the n-th datum obtained 
at the position 0„, the smooth map is defined as 



he) = 



ErMe~On) 



(1) 



It is reasonable to assume that this standard approach works 
well and produces good results, and the frequent use of this 
estimator suggests this is the case. On the other hand, in our 
opinion the properties of the smoothing should be better char- 
acterized by means of rigorous calculations. Some authors have 
actually already studied the smoothing using some approx- 
imations, and have obtained some preliminary results (e.g., 
Lombardi & Bertin 1998, van Waerbeke 2000). To our knowl- 
edge, however, the general problem has not been fully ad- 
dressed so far and in particular there are no exact results known. 

In this paper we consider in detail the effect of the smooth- 
ing on irregularly sampled data and derive a number of exact 
properties for the resulting map. We assume that the measure- 
ments /„ are unbiased estimates of some unknown field f{9) 
at the positions 0„, and we study the expectation value of the 
map f{9) using an ensemble average, i.e. taking the N posi- 
tions {On} as random variables. We then show that the expec- 
tation value for the smooth map of Eq. ([l|) is given by 
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(/») = / f{e')w,s[e~e')A^e' 



(2) 
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Thus, (/) is the convolution of the unknown field / with an 
effective weight Wcs which, in general, differs from the weight 
function w. We also show that WcS has a "similar" shape as 
w and converges to w for a large number of objects N, but in 
general Wcff is broader than w. Moreover, the effective weight 
is normalized, so that no signal is "lost" or "created." Finally, 
we obtain some analytical expansions for Wcs, and investigate 
its behavior in a number of interesting cases. 

A common alternative to Eq. (|l]) is a non-normalized 
weighted sum, defined as 



1 ^ 



(3) 



where w has been assumed to have unit integral [see below 
Eq. (^] . The statistical properties of this estimator can be easily 
derived, and in particular we obtain for its expectation value [cf. 
Eq. (§] 



(/>))= / f{e')w{e-e')d'e' . 



However, we note that this non-normalized estimator is ex- 
pected to be more noisy than the one defined in Eq. because 
of sampling noise. For example, even in the case of a flat field 
f{9) = 1 measured without errors (so that /„ = 1) we expect 
to have a noisy map if we use Eq. (^. For this reason, whenever 
possible the estimator (|l]) should be used instead. 

The paper is organized as follows. In Sect. ^ we derive 
some preliminary expressions for the mean value of the map of 
Eq. (|l]). These results are generalized to a variable number N 
of objects in Sect. |[ If the weight function w{9) is allowed to 
vanish, then some peculiarities arises. This case is considered 
in detail in Sect. Q Section || is dedicated to general properties 
for the average of / and related functions. In Sect. || we take 
an alternative approach which can be used to obtain an analytic 
expansion for /. In Sect. ^ we consider three specific examples 
of weight functions often used in practice. Finally, a summary 
of the results obtained in this paper is given in Sect. ^ A vari- 
ation of the smoothing technique considered in this paper is 
briefly discussed in Appendix ^ 

2. Definitions and first results 

Suppose one wants to measure an unknown field f{9), a func- 
tion of the "position" 6. [What 6 really means is totally irrel- 
evant for our discussion. For example, 9 could represent the 
position of an object on the sky, the time of some observation, 
or the wavelength of a spectral feature. In the following, to fo- 
cus on a specific case, we will assume that 9 represents a posi- 
tion on the sky and thus we will consider it as two-dimensional 
variable.] Suppose also that we can obtain a total of N unbiased 
estimates /„ for / at some points { 0„ } , and that each point can 
freely span a field il of surface A (O represents the area of the 
survey, i.e. the area where data are available). The points { 0,i } , 
in other words, are taken to be independent random variables 
with uniform probability distribution and density p = N/A in- 
side the set 57 of their possible values. We can then define the 



smooth map of Eq. (|l]), or more explicitly 



En=l frMO - On) 



f{0) 



9r^ 



(5) 



In the rest of this paper we study the expectation value {f{9)') 
of f{9) (an alternative weighting scheme is briefly discussed in 
Appendix|A|). To simplify the notation we will assume, without 
loss of generality, that the weight function ■w{9) is normalized, 
i.e. 



/ w{9)d'^e = 1 
Jo, 



(6) 



In order to obtain the ensemble average of / we need to 
average over all possible measurements at each point, i.e. /„, 
and over all possible positions { 9n } for the N points. The first 
average is trivial, since f{6) is linear on the data /„ and the 
data are unbiased, so that (/„) = f{9n). We then have 



(4) Uie)) = ^ / d^e, I d 



Ell f{en)w{9 - 0„) 



N- 



Q. 



(7) 



Relabeling the integration variables we can rewrite this expres- 
sion as 



(/») - 



N 



X 



SI 



d?6 



n 



N- 



f{9i)w{9-9i) 



We now define a new random variable 

N 

y{e) = Y,w{e-9.^) . 

n=2 



(8) 



(9) 



Note that the sum runs from n = 2to n = N. Let us call Py{y) 
the probability distribution for y{9). If we suppose that 9 is 
not close to the boundary of ft, so that the support of w{9 — 
0') (i.e. the set of points 9' where w{0 — 9') ^ 0) is inside 
57, then the probability distribution for y{9) does not depend 
on 9. We anticipate here that below we will take the limit of 
large surveys, so that Vt tends to the whole plane, and A ^ oo, 
N ^ oo, such that p = N/A remains constant. Since, by 
definition, the weight function is assumed to be non-negative, 
Py{y) vanishes for y < 0. Analogously, we call Pw{w) the 
probability distribution for the weight w. These two probability 
distributions can be calculated from the equations 



n 



1 



^ Jn 

oo 

dw2Pw{w2) i dW3P^u{w3) ■ ■ 







(10) 



Wn) 







X / dwNp^{wN)S{y - W2 
Jo 



wn), (11) 
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where S is Dirac's distribution and where we have called w„ = 
w{On). Note that Eqs. (|lO|) and ([n|) hold only if the N points 
{On} are uniformly distributed on the area A with density p, 
and if there is no correlation (so that the probability distribu- 
tion for each point is pe{dn) — 1/^)- Moreover, we are as- 
suming here that the probabihty distribution for y{9) does not 
depend on 0. This is true only if a given configuration of points 
{©„} has the same probability as the translated set {0„ + </>}. 
This translation invariance, clearly, cannot hold exactly for fi- 
nite fields ri; on the other hand, again, as long as 9 is far from 
the boundary of the field, the probability distribution for y{d) 
is basically independent of 0. Note that in case of a field with 
masks, we also have to exclude in our analysis points close to 
the masks. 

Using py we can rewrite Eq. (Q) in a more compact form: 



(/>)) = p / dH,j{e,)w{e~e,) 

Jn 

^ Ay 



/o w[e-ei) + y 

where, we recall, p — N/A is the density of objects. For the 
following calculations, it is useful to write this equation as 

(/>))= / d^9'f{e')wie-e')c{w{e-e'))d'9' , (i3) 

Jn 

where C(w) the correcting factor, defined as 



C{w)=pI P^dy 

w + y 



(14) 



Finally, we will often call the combination WcffiO) = 
'w{9)C[w{6)), which enters Eq. (|l3|), effective weight. 

Interestingly, Eq. ( [l3| ) shows that the relationship between 
and f{9) is a simple convolution with the kernel Wcs- 
From the definition (^, we can also see that this kernel is nor- 
malized, in the sense that 



{fie))d^e^ / fie)d'9 

n Jn 



(15) 



In fact, if we consider a "flat" signal, for instance f{9) ~ 1, we 
clearly obtain = 1. On the other hand, from the prop- 

erties of convolutions, we know that the ratio between the l.h.s. 
and the rh.s. of Eq. ( [l5| ) is constant, independent of the func- 
tion f{0). We thus deduce that this ratio is 1, i.e. that Eq. ( [1^ 
holds in gener al. T he normalization of Wcff will be also proved 
below in Sect. 5.1 using analytical techniques. 

If Pyiy) is available, Eq. ( p^ can be used to obtain the 
expectation value for the smoothed map /. In order to ob- 
tain an expression for py we use Markov's method (see, e.g., 
Chandrasekhar 1943; see also Deguchi & Watson 1987 for an 
application to microlensing studies). Let us define the Laplace 
transforms of py and p^: 



W{s) = C[pM^ I 
Jo 



e ''^pw{w)dw 



A / e-'^-W d^0 



(16) 



Y{s)^C[py]{s) 



Hence py can in principle be obtained from the following 
scheme. First, we evaluate W{s) using Eq. (|l6|), then we cal- 
culate Y{s) from Eq. ([T7|), and finally we back-transform this 
function to obtain Py{y). 

3. Continuous limit 

So far we have considered a finite set Q, and a fixed number 
of objects N . In reality, one often deals with a non-constant 
number of objects, so that N is itself a random variable. [For 
example, if the objects we are studying are galaxies and if VI is 
a small field, the expected number of galaxies in our field will 
follow a Poissonian distribution with mean value pA, where p 
is the density of detectable galaxies in the sky.] Clearly, when 
we observe a field we will obtain a particular value for the 
number of objects N inside the field. However, in order to ob- 
tain more general results, it is convenient to consider an en- 
semble average, and take the number of observed objects as a 
random variable; the results obtained, thus, will be averaged 
over all possible values of N . 

A way to include the effect of a variable N in our frame- 
work is to note that, although we are observing a small area on 
the sky, each object could in principle be located at any posi- 
tion of the whole sky. Hence, instead of taking iV as a random 
variable and A fixed, we consider larger and larger areas of the 
sky and take the limit — > oo. In doing this, we keep the ob- 
ject density p = N/A constant. It is easily verified that the two 
methods (namely A fixed and N Poissonian random variable 
with mean pA, or rather A ^ oo with p — A/N fixed), lead 
to the same results. In the following, however, we will take the 
latter scheme, and let A goes to infinity; correspondingly we 
take il as the whole plane. 

From Eq. (|l6|) we see that W{s) is proportional to 1/A. For 
this reason it is convenient to define the function 



-sto(e) 



1 



d-'e 



(18) 



If A is finite, we have Q{s) = AW{s) — A. Thus we can write, 
in the hmit ^ — > oo. 



Y(s) = lim 



Q{s)p 

N 



N-l 



(19) 



This equation replaces Eq. (17) when N ^ oo 



/ e-^ypy{y)dy = [W{s)]''-' . (17) H(x) = 
Jo 



In order to further simplify the expression for the correcting 
factor C{w), we rewrite its definition as 

C(w)^pr^^dx, (20) 
Jo X 

where x = y + w, and the function is defined as 

Cniix) ^ il{x - w)py{x - w) . (21) 

Here H(a; — w) is the Heaviside function at the position w, i.e. 

'O ifx<0, 



1 otherwise. 



(22) 
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The Laplace transform of (^^ can be written as 



Z^{s) ^ C[Cu.]is) 



(23) 



In reality, for C{w) we need to evaluate an integral over 
Cto {^) From the properties of the Laplace transform we have 



J S 

and thus we find 

C{w) = pC[<:^{x)/x\ {0)^p Z^{s') ds' 

Jo 

1*00 

= p e-'^'^'r (s') ds' = C[pY] [w) . 



(24) 



(25) 



This important result, together with Eqs. (|l8]) and ([l9|), can be 
used to readily evaluate the correcting factor. We note that, for 
our purposes, there is no need to evaluate the probability dis- 
tribution py any more. This prevents us from calculating any 
inverse Laplace transform. 

4. Vanishing weighits 

Since w{0) > for every G f2, y is non-negative, so that 
Py{y) = if J/ < 0. In principle, however, we cannot exclude 
that the case y ~ has a finite probability. In particular, for 
finite support weight functions, Py{y) could include the contri- 
bution from a Dirac delta distribution centered on zero. 

Since y is the sum of weights at different positions, y — 
can have a finite probability only if w{9) vanishes at some 
points. In turn, if Pq — P{y = 0) is finite we could encounter 
situations where the numerator and the denominator of Eq. 
vanish. In such cases we could not even define our estimator 
/. We also note that, in the continuous limit, Pq is also the 
probability of having vanishing denominator in Eq. (js]). As a 
result, if Pq > 0, we have a finite probability of being unable 
to evaluate our estimator! 

So far we have implicitly assumed that Pq vanishes. 
Actually, we can explicitly evaluate this probability using the 
expression 



P{y = 0) = lim / Py{y')dy' 



(26) 



From the properties of the Laplace transform this expression 
can be written as 



P(y = 0) = lim y(s) = lim qP^^" 

S—^OO S— i-OO 

We now observe that 



lim 

S— i-OO 



-sw{x) 



1 



-1 ifw^O, 
otherwise. 



(27) 



(28) 



As a result, in the limit considered, —Q{s) approaches the area 
of the support of w. Calling this area tt^, we find finally 



P{y = 0) 



(29) 



The result expressed by Eq. (g9|) can actually be derived 
using a more direct approach. Since 'w{9) > 0, the condition 
y — requires that all weights except w{6i) are vanishing. 
In turn, this happens only if there is no object inside a neigh- 
borhood of the point considered. Since the area of this neigh- 
borhood is TTu,, the number of objects inside this set follows a 
Poisson distribution of mean p7r^, and thus the probability of 
finding no object is given precisely by Eq. (p9|). 

As mentioned before, Pq > is a warning that is same 
cases we cannot evaluate our estimator /. In order to proceed 
in our analysis allowing for finite-support weight functions, we 
decide to explicitly exclude in our calculations cases where 
w + y — 0: In such cases, in fact, we could not define /. 
In practice when smoothing the data we would mark as "bad 
points" the locations 6 where f{6) is not defined. In taking the 
ensemble average, then, we would exclude, for each possible 
configuration the bad points. In order to apply this pre- 

scription we need to modify Py, the probability distribution for 
y, and explicitly exclude cases with w + y = Q.ln other words, 
we define a new probabihty distribution for y given by 



Pyiy) 



\py{y) ifw^O, 
I [p,(2/)-Po<5(2/)]/(l-Po) ifu; = 0. 



(30) 



Hence, if w — Q, we set to zero P{y — 0) and then renor- 
malize the distribution. An important consequence of the new 
prescription is that the probability distribution for y no longer 
is independent of w. In fact, the probability P{y — 0) van- 
ishes for w = 0, while is finite (for a finite-support weight) if 
w ^ 0. Using Eq. (pO[ ) in the definition of Y{s) we then find 
(see Eqs. ^ and (pE) 



Y{s) 



A{w) 



l-Po 



1 



(31) 



where A{w) is 1 for = and vanishes for w > 0. 
Equation (|T]) replaces Eq. (jl^ for the cases when Pq > 0, 
i.e. for weight functions with finite support. 

In order to implement the new requirement w + y ^ 0, 
we still need to modify Eq. ( p^ [or, equivalently, Eq. ([l4|)]. 
In fact, in deriving that result, we have assumed that all objects 
can populate the area ft with uniform probability and in fact we 
have used a factor l/A^ to normalize Eq. (^. Now, however, 
we must take into account the fact that objects cannot make a 
"void" around the point 6. As a result, we need a further factor 
1/(1 — Pq) in front of Eqs. (|l2|) and ( 14 ). In summary, the new 
set of equations is 



Y{s) = ePQ^'^ - A{w) 
P 



d^e 

Po 



1 



C{w) 



1-Pr 



Jo 



e-'"'Y{s) ds 



(32) 
(33) 
(34) 



In the case where w{9) has infinite support (for example if w where, we recall, Pq = e vanishes for weight functions 



is a Gaussian), Pq vanishes as expected. 



with infinite support tt^ 
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5. Properties 

In this section we show a number of interesting properties of 

C{w) and (/>)). 



5. 1. Normalization 



As already shown [see above Eq. ( 13 )], is a simple con- 

volution of f{9) with Wcff- The normalization / of this convo- 
lution can be obtained from the expression 



From Eq. (B4b we find 



/ = 



1-Pf 



d^0 



Jn 



(35) 



(36) 



We first note that, in the general case, Y{s) depends on w{6). 
On the other hand, since the second term of is propor- 
tional to A(w) [see Eq. (^3|)], it does not contribute to the inte- 
gral. We can then evaluate first the integral over 9, obtaining 



(37) 



n 



and thus we have 



1 - ^0 7o 



Q'{s)ePQ^'^ ds 



1-Pn 



(38) 



Hence we conclude that the estimator is correctly normal- 
ized. 

5.2. Scaling 

It is easily verified that a simple scaling property holds for 
the effective weight. Suppose that we evaluate Wcs{9) for a 
given weight function w{9) and density p. If we rescale the 
weight function into k'^w{k9), and the density into k^p, where 
fc is a positive factor, the corresponding effective weight is also 
rescaled similarly to w, i.e. k'^Wcs{k9). 

We also recall here that, although a normalized weight 
function has been assumed [see Eq. (Q)], all results are clearly 
independent of the normalization of w. Hence, we can also con- 
sider a trivial scaling property: The weight function kw{9) has 
effective weight Wcs (9) independent of k. 

We anticipate that the effective weight is very close to the 
original weight w for large densities p (see below Sect. 5.4). 
The scaling properties discussed above suggest that the shape 
of the effective weight is actually controlled by the expected 
number of objects for which the weight is significantly different 
from zero. This justifies the definition of the weight area A of 
w: 



(39) 



The first factor in this definition ensures that A does not de- 
pend on the normalization of w{9). It is easily verified that 











A^ 


[ w{e)d^e 


7 


/ [w{9)]^d^e 




Jn 




Jn 




Fig. 1. The effective weight Wcs never exceed p. The original 
weight w(0) is the combination of two Gaussians with different 
widths ((Ti = 1 and a-z = 0.1). The central peak is severely 
depressed for relative low-densities. Note that the weight area 
is ^ ~ 12.2, so that a density of p = 0.2 coiTesponds to jV ~ 
2.4. 



A = TTw for a top-hat weight function. Correspondingly, we 
define the weight number of objects as AA = pA. Clearly, this 
quantity is left unchanged by the scalings considered above. 
Similar definitions can be provided for the effective weight 
WcS- Explicitely, we have 



A 



off 



w,s{e)d^e 



f [w,si9)]'d^e 

Jn 



(40) 



and A/'cff = pAcB- 

Numerical calculations for WeS show clearly that J\f, rather 
than p, is the key factor that controls how close the effective 
weight Wes{9) is to w{9) (cf. Figs. [l|, |, and |). 

5.3. Behavior of wC{w) 

We can easily study the general behavior of WcS- We first con- 
sider the case of infinite-support weights (Pq = 0). 

We first note that Y{s) > for every s, and thus C{w) 
decreases as w increases. On the other hand, we have 



Weff = wC{w) = pY{0) + pC[Y']{w) 



(41) 



Since Y'{s) = pQ'{s)Y{s) < is negative, we have that 
■wC{w) increases with w. This result shows that the effective 
weight WcS follows the general shape of the weight w, as ex- 
pected. Moreover, the fact that C{w) is monotonically decreas- 
ing implies that the effective weight Wcs is "broader" than w. 
For instance, we can say that there is a value wi for the weight 
such that Woff = wC{w) is not larger than w for w > wi, and 
not smaller than w for w < wi. In fact, since 0(111) is mono- 
tonic, the equation C{w) = 1 can have at most one solution. 
On the other hand, if C{w) < 1 (respectively, if C{w) > 1) for 
all w, then Wcff(0) < w{9) (wcs{9) > w{9)) for all 9. This 
inequality, however, cannot be true since both w and WcS are 
normalized. 
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The property just shown can be used to derive a relation 
between the weight area A and the effective weight area ^off ■ 
Let us evaluate the integral 

f [w{e)+w,siO)-2wi][wi9)-w,eie)]d^0 . (42) 
Jn 

This quantity is positive, since the integrand is positive (this is 
easily verified by distinguish the two cases w{6) > wi and 
w{6) < wi, and by noting that both factors in the integrand 
have the same sign). On the other hand, if we expand the prod- 
uct we obtain 

o<D^ [ [w{e)]^d^e~ [ [w,s{e)]^dH 

Jn Jn 

-2wi [ [wi9) ~ w,s{e)] d'^e 

Jn 



(43) 



The last relation holds because of the normalization of w{6) 
and Weff (0)- We thus have shown that the effective weight area 
^off is always larger than the original weight area A; analo- 
gously we have A/^ff > J^- This, clearly, is another indication 
that Wcs is "broader" than w. 

It is also interesting to evaluate the limits of ■wC{w) for 
small and large values of w. We have 



lim wC{w) = pY{0) = p 



(44) 



Since we know that wC{w) is monotonic, p is also a superior 
limit for the effective weight function. In other words, even if 
w has high peaks, the effective weight Wcti (that, we recall, 
is normalized) will never exceed the value p (see Fig. [T]). We 
stress here that, since Woff is normalized, its maximum value 
[which, in virtue of Eq. ( p4| ) does not exceed p] is a significant 
parameter. For example, using the relation Wcs{9) < p 'm the 
definition of ^off we find immediately 



A. 



cff 



[w,s{e)YdH<p I w,ffie)d:'d 

n 



(45) 



or, equivalently, Afcs > 1. In other words, no matter how small 
A is, the effective weight will always "force" to use at least one 
object. 

Equation ( |44| ) suggests also a local order-of-magnitude 
check for the effective weight: Assuming w{9) normalized, 
we expect the effective weight to be significantly different 
from uu for points where w{9) is of the order of p or larger. 



Equation (45), instead, provides a global criterion: The effec- 



tive weight will be significantly broader than w if JV is of the 
order of unity or smaller. As anticipated above, thus, Af is the 
real key factor that controls the shape of Woff with respect to w. 
The other limit for the effective weight is 



lim wC'{w) = p lim Y{s) = 



(46) 



Thus, as expected, the effective weight Wcs vanishes as w van- 
ishes. 

If w has finite support, then the situation is slightly differ- 
ent. Given the definition of Y{s), we expect for C{w) a dis- 
continuity for w = 0. Apart from this difference, the behavior 



of C{w) and of wC{w) is similar to the case considered above, 
namely C{w) is monotonically decreasing and wC{w) is in- 
creasing. We also have 



lim 

w — >oo 



wC(w) 



l-Pn 



(47) 



which is similar to Eq. (^4|). Note that, as expected from the 
normalization of WcS, t^wP/{^ — Pq) > 1 for all densities. We 
also find 

pPo 



lim wC'{w 



l-Po 



(48) 



In other words, the effective weight wC{w) does not vanish for 
small w. If, instead, we take w = 0, then we have wC(w) ~ 0. 
This discontinuity, related to the term A(w) in Eq. (P3|), is a 
consequence of a number of properties for the effective weight: 
(i) WcS is normalized; (ii) Wcs is broader than w; (iii) Wcs has 
the same support as w. We thus are forced to have a disconti- 
nuity for the effective weight. 

The result obtained above is also convenient for simplifying 
equations for finite-support weight functions. In fact, for w > 
we can clearly drop the last term of Eq. (33), thus recovering 
Eq. (p^; if, instead, w = 0, we can directly set wC{w) — 0, 
without further calculations. 

5.4. Limit of Inigin and low densities 

The final result considered in this section is the behavior of the 
correcting factor C{w) in the limit p ^ oo and p ^ 0. 

We observe that Q{s) < and moreover |Q(s)| increases 
with s. Thus, if s is large, eP'^'-"^ vanishes quickly when p 
oo. As a result, in the limit of large densities we are mainly 
interested in Q{s) with s small. Expanding Q{s) around s — 
0+, we find 



k=l 

where Sk is the fc-th moment of w: 



S, 



f [w{e)]''d^e 

Jn 



(49) 



(50) 



The normalization of w clearly implies 5*1 = 1. Assuming w > 
0, to first order we have then Y{s) ~ e^'*'', so that 

1 r°° .... . 1 P 



C{w) 



1-Po 



(51) 



l-Pop + w 

This expression gives the correcting factor at the first order in 
If u> has infinite support the expression reduces to C(z«) = 
l/[l + w/ p\. Note that the quantity w/ p ^ 1/M is related 
the weight number, i.e. the expected number of objects which 
contribute significantly to the signal, for which the weight is 
not exceedingly small. Finally, at the zero order in the limit 
p oo, C{w) converges to unity. In this case the map f{0) 
is expected to be a smoothing of f{0) with the same weight 
function w. 

In the limit p ^ 0, we have Y{s) 1, and thus 
CH-^^. (52) 
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0.16 




Fig. 2. The effective weight Wcs can be well approximated us- 
ing the expansion (|66|). This plot shows the behavior of the rt-th 
order expansion for a Gaussian weight function with unit vari- 
ance (see Eq. (^)). The density used, p — 0.5, corresponds to 
J\f ~ 6.3. The convergence is already extremely good at the 
second order; for larger values of Af the expansion converges 
more rapidly. 



In the same limit, Pg — 1 ^ t^wP, so that we find 



Wcs — wC{w) 



1 



(53) 



Recalling that wC{w) vanishes for w — 0,we conclude that the 
effective weight converges to a top-hat function with support 
TTw if w{0) has finite-support (see Fig. Qfor an example). 

6. Moments expansion 

In the last section we have obtained an analytical expansion for 
Q{s) which has then been used to obtain a first approximation 
for the correction function C{w) valid for large densities p. 
Unfortunately, we have been able to obtain a simple result for 
C{w) only to first order. Already at the second order, in fact, 
the correcting function would result in a rather complicated ex- 
pression involving the error function erf. 

Actually, there is a simpler approach to obtain an expansion 
of C{w) at large p using the moments of the random variable 
y. Given the definition (^ for y, we expect that y = (y), the 
average value of this random variable, increases linearly with 
the density p of objects. Similarly, for large p the relative scatter 
{{y — vT) Iv"^ is expected to decrease. In fact, y is the sum of 
several independent random variables, and thus, in virtue of the 
central limit theorem, it must converge to a Gaussian random 
variable with appropriate average and variance. 

Since the relative variance of y decreases with p, we can 
expand y in the denominator in Eq. (|l4|), obtaining 



Jo y + w 



fc=0 

1 



k y-y 



k=0 



(y + vj) 



fc+ 



y + w 

jiiy-y)') 



(54) 



where we have used the definition of the moments of y: 

y^{y)= / Pviy)y<iy , (55) 

Jo 

poo 

((y-y)')= / Py{y){y~yfdy. (56) 

Jo 

In other words, if we are able to evaluate the moments of y we 
can obtain an expansion of C{w). Actually, the "centered" mo- 
ments can be calculated from the "un-centered" ones, defined 
by 



iy') 



p,(y)y'=dy=.(-l)'=rW(0) 



(57) 



Here we have used the notation Y^''^ (0) for the fc-th derivative 
of Y{s) evaluated at s = 0. Using Eq. (|l9|) we can explicitly 
write the first few derivatives 



1 



r(0) 
y'(0) 

r"(o) 

y"'(0) = pQ"'{Q) + 3p2Q"(0)Q'(0) + p3 [g'(0)]^ 
y(*H0) = PQ^^HO) + Vg"'(0)Q'(0) + 3p^[Q"{0)y 



pQ"{0) + p'[Q\ 



Qp'Q"{0)[Q'{0)Y + p'[Q'{Q)]' 



(58) 
(59) 

(60) 

(61) 



(62) 



A nice point here is that, in principle, we can evaluate all the 
derivatives of Y{s) in terms of derivatives of Q{s) without any 
technical problem. Moreover, the derivatives of Q{s) in zero 
are actually directly related to the moments of In fact we 
have 

(0) = (-1)'= / [w{e)] ' = i-l)'^Sk . (63) 
Jn 

This simple relation allows us to express the moments of y in 
terms of the moments of w. For the first "centered" moments 
we find in particular 



y^{y) = p, {{v-yf) = pS2, 
{{y - vf) - PS^ , {{y - yf) = pS^ + ip'sl 



(64) 
(65) 



Hence, we finally have 

P^S2 



C{w) 



P 



p + w {p + w) 



P^S, 

[p + wY 



+ 



p'Si 



{p + wy 



(66) 



The first term if this expansion, p/{p + w), has already been 
obtained in Eq. (^1|). Other terms represents higher order cor- 
rections to C{w). In Fig. H we show the result of applying this 
expansion to a Gaussian weight. 

In closing this section we note that, regardless to the value 
of TTuj, Pq = e~P^^ is vanishing at all orders for p oo, 
and thus we cannot see the pecuUarities of finite-support weight 
functions here. 
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7. Examples 

In this section we consider three typical examples of weight 
functions, namely a top-hat function, a Gaussian, and a 
parabolic weight function. For simplicity, in the following we 
will consider weight functions with fixed "width." The results 
obtained can then be adapted to weight functions with different 
widths using the scaling property (Sect. 5.2). 



7. 1. Top-hat 

The simplest case we can consider for w{9) is a top-hat func- 
tion of unit radius, which can be written as 



In this case we immediately find for w > 

g(s) = ^(e-^/^-i) , 

r(s)=exp[7rp(e-'*/^-l)] . 



(67) 



(68) 
(69) 



We now note that since w{9) is either or 1/ vr, we just need to 
evaluate C{1/tt). We then find 



Jo 



(70) 



as expected. 

For the top-hat function we can also explicitly obtain the 
probability distribution for y. If w > we have 



Py 



nnl 



■S{y - n/n) 



(71) 



and thus 



C{w) = 



P 



Pyiy) 



dy 



1 - -Po 7o w + y 

pPo ^ {p^r 



7r(l — Pn) + n/7i)n\ 

^ ' n— 



(72) 



From this expression we easily obtain C(l/7r) = 1. Moreover, 
we can evaluate the two limits 



lim wC{w) 



lim wC'{w) 



P 



1-Po ' 
pPo 



(73) 
(74) 



thus regaining the results of Sect. 5.3 



0.16 
0.14 - 
0.12 - 
0.10 — 
0.08 - 
0.06 - 
0.04 - 
0.02 - 

— 





p = 0.2 
p = 0.5 
p = 1.0 
p = 2.0 



0.5 



1.5 

\0\ 



2.5 



Fig. 3. Effective weight function corresponding to a Gaussian 
weight function. The original weight function is a normal- 
ized Gaussian of unit variance, with weight area A ~ 12.5. 
Significantly broader effective weights are obtained if the 
weight number is smaller than M < 10. 




Fig. 4. Effective weight function corresponding to a Gaussian 
weight function. Note the discontinuity at \9\ = 1, correspond- 
ing to the boundary of the support of w. The original weight 
function is a normalized parabolic function with weight area 
A ~ 2.4. Note that this weight area is significantly smaller 
than the ones encountered in previous examples. For low den- 
sities, Woff conver ges t o a top-hat function, in accordance with 
the results of Sect. 5.4 



7.3. Parabolic weight 

As the last, example we consider a parabolic weight function 
with expression 



7.2. Gaussian 

A weight function commonly used is a Gaussian of the form 
1 



^0) = — exp(-|0|V2) 



(75) 



Unfortunately, we cannot explicitly integrate Q{s) and thus we 
are unable to obtain a finite expression for C{w). The results 
of a numerical calculations are however shown in Fig. |[ 



w{e) 



(l-l^n for|0|<l, 



otherwise 



We then find 

2 _c-2s/7r 



2s 



(76) 



(77) 



Unfortunately, we cannot proceed analytically and determine 
C(w). We thus report the results of numerical integrations in 
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Fig. ^ Note that, as expected, the resulting effective weight 
has a discontinuity at \9\ — 1. 

8. Conclusions 

In this paper we have studied from a statistical point of view the 
effect of smoothing irregularly sampled data. The main results 
obtained can be summarized in the following points. 

1. The mean smooth map, is a convolution of the un- 
known field f{6) with an effective weight Wcff- 

2. We have provided simple expressions to evaluate the ef- 
fective weight. These expressions can be easily used, for 
example, to obtain numerical estimates of Wcg. 

3. The effective weight Wcff{6) and the weight function 
w{6) share the same support and have a similar "shape." 
However, Wes is broader than w, expecially for low den- 
sities of objects; moreover, Wcs has a discontinuity on the 
boundary of the support. 

4. We have shown that the density of objects p (or p/ (1 — Pq) 
for finite-support weight functions) is a natural upper limit 
for the effective weights. 

5. The weight number Af has been shown to be the key factor 
that controls the convergence of Wctf to w. We have also 
shown that TVcff > 1. 

6. The effective weight converges to w{9) for large densities 
p, and to a top-hat function for low densities. 

7. We have provided an analytic expansion for Wcs which is 
shown to converge quickly to the exact weight function. 

8. Finally, we have considered three typical examples and 
shown the behavior of Wcs for different densities. 

Given the wide use in astronomy of the smoothing technique 
considered in this paper, an exact statistical characterization of 
the expectation value of the smoothed map is probably inter- 
esting per se. 

Finally, we notice that other methods different from 
Eqs. (|]) or (Q) can be used to obtain continuous maps from 
irregularly sampled data. In particular, triangulation techniques 
can represents an interesting alternative to the simple weighted 
average considered here (see, e.g., Bernardeau & van de 
Weygaert 1996; Schaap & van de Weygaert 2000). 

Acknowledgements. We would like to thank the Referee for com- 
ments and suggestions that enabled us to improve this paper. 

Appendix A: Object intrinsic weight 

The simple weighting scheme considered in this paper [see 
Eq. (||)] is actually the base of several similar schemes with 
slightly different properties. In this paper, for simplicity and 
clarity, we have confined the discussion to the simple case, 
which is already rich of peculiarities and unexpected results 
(for example, the behavior in case of vanishing weights). In 
this appendix, however, we will describe a slightly more com- 
plicated estimator often used in astronomy. 

Suppose, as for the Eq. (||), that we can measure a given 
field f{9) at some positions 0„ inside a set ft. Suppose also 
that we decide to use, for each object, a weight u„ > which 



is independent of the position 0„ (the weight u„ could however 
depend on other intrinsic properties of the object, such as its 
luminosity or angular size). We then replace Eq. ^ with 



Sn^l fnUnW{6 - Or. 



f{0) 



(A.1) 



J2n=l UnW{6 - 0n) 

A typical situation for which this estimator is useful, is when 
some error estimates {cr„} are available for each object. Then, 
if we set u„ = l/u^j, we obtain an estimator which optimizes 
the signal-to-noise ratio. 

Since the quantities do not depend on the position, we 
can study the statistical properties of the estimator ik.l) pro- 



vided that the probability distribution Pu{u) of u is available. 
In particular we obtain [cf. Eq. (0)] 



(/») - ^ 



N 



dUN Pu{UNj 7^]v 



Y.n=l UnW{6 - On) 



Similarly to Eq. (H), we define 



N 
n=2 



N 
n=2 



(A.2) 



(A.3) 



within = Unw{On). Using J/ and and defining the proba- 
bility distributions py (y) and p„ (v), we can write the analogous 
of Eqs. (n^ and (pi]). In this way we obtain results similar to 
Eqs. ([T2[-p^ that we do not report here. Finally then we have 



V{s) = C[p^]{s) = / e->„(«)d« 
Jo 

^ /-oo 



A 



Pu{u) du 







(0) 



n 



^ypy{y)dy^[Vis)] 



(A.4) 



(A.5) 



In other words, we basically recover Eqs. ( |l6| ) and ( [l7| ) with 
the significant difference that now the Laplace transform V{s) 
of Py (v) is given by the superpositions of functions like W{us) 
weighted with the probability distribution Pu{u). Note that all 
functions W{us) have the same shape but differ for the scaling 
of the independent variable. In case where Pu{u) — 5{u — 1) 
is Dirac's delta distribution centered in 1 we exactly reproduce 
Eqs. ( |l6[ ) and (|l7[), as expected. 

The continuous limit (cf. Sect. |^) does not present partic- 
ular difficulties. The first significant change regards Eq. (^Sj), 
which now becomes 



R{s) = / Pu{u) du 

Jo Jn 



Pu{u)Q{us) du . 



0(6) 



d^e 



(A.6) 



CoiTespondingly, Y{s) — exp[/9i?(s)] . Finally, writing the ef- 
fective weight as Wcs — wC{w), we have 



C{w) 



Pu{u)B(uw) du 



B{w) = C[pY]{w) 



(A.7) 
(A.8) 
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The equations written here can be used to practically eval- 
uate B{w) (and thus Wcs{6)), but also to derive the proper- 
ties of the effective weight, as done in for the simple estima- 
tor (|). Here we do not caiTy out the calculations, since they 
are straightforward modifications of the calculations of Sect. ^ 
moreover the results obtained are basically identical to the ones 
reported in that section. 
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